CDS 


TECHNICAL  MEMORANDUM  NO.  CIT-CDS  93-015 

August  30, 1993 


"The  Rank  One  Mixed  \x  Problem  and 
'Kharitonov-Type'  Analysis" 

Peter  M.  Young 


Control  and  Dynamical  Systems 

California  Institute  of  Technology 

Pasadena,  CA  91125 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

30  AUG  1993  2  REPORT  TYPE 

3.  DATES  COVERED 

00-00-1993  to  00-00-1993 

4.  TITLE  AND  SUBTITLE 

The  Rank  One  Mixed  u  problem  and  ’Kharitonov-Type’  Analysis 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

California  Institute  of  Technology, Control  and  Dynamical 

Systems, Pasadena, C  A, 91 125 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF:  17.  LIMITATION  OF 

18.  NUMBER  19a.  NAME  OF 

a.  REPORT  b.  ABSTRACT  c.  THIS  PAGE 

unclassified  unclassified  unclassified 

24 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


The  Rank  One  Mixed  fi  Problem  and 
“Kharitonov- Type”  Analysis 

Peter  M.  Young 

Electrical  Engineering,  116-81 
California  Institute  of  Technology 
Pasadena,  CA  91125,  U.S.A. 


Abstract 

The  general  mixed  p  problem  has  been  shown  to  be  NP  hard,  so  that  the  exact  solution  of 
the  general  problem  is  computationally  intractable,  except  for  small  problems.  In  this  paper  we 
consider  not  the  general  problem,  but  a  particular  special  case  of  this  problem,  the  rank  one 
mixed  p  problem.  We  show  that  for  this  case  the  mixed  p  problem  is  equivalent  to  its  upper 
bound  (which  is  convex),  and  it  can  in  fact  be  computed  easily  (and  exactly).  This  special  case 
is  shown  to  be  equivalent  to  the  so  called  “affine  parameter  variation”  problem  (for  a  polynomial 
with  perturbed  coefficients)  which  has  been  examined  in  detail  in  the  literature,  and  for  which 
several  celebrated  “Kharitonov-type”  results  have  been  proven. 


1  Introduction 

It  is  now  known  that  the  general  mixed  p  problem  is  NP  hard,  and  this  strongly  suggests  that  the 
exact  solution  of  the  general  problem  is  computationally  intractable,  except  for  small  problems  [3]. 
In  this  paper  we  consider  not  the  general  problem,  but  a  particular  special  case  of  this  problem, 
the  rank  one  mixed  p  problem.  The  rank  one  mixed  p  problem  is  that  of  computing  pjc(M)  for 
M  =  uv*  with  u,  v  G  Cra. 

Note  that  imposing  the  condition  that  M  be  a  dyad  is  a  strong  restriction,  and  this  will  limit 
the  applicability  of  this  analysis  to  real  engineering  applications.  The  reason  for  our  interest  in 
this  particular  problem  is  that  it  turns  out  that  a  special  case  of  the  rank  one  mixed  p  problem  is 
equivalent  to  the  so  called  “affine  parameter  variation”  problem  (for  a  polynomial  with  perturbed 
coefficients)  which  has  been  examined  in  detail  in  the  literature,  and  for  which  several  celebrated 
“Kharitonov-type”  results  have  been  proven.  These  results  provide  exact  robust  stability  tests  for 
such  problems,  with  respect  to  real  parametric  uncertainty  (see  [12]  and  the  references  therein). 

It  will  be  seen  that  this  special  p  problem  does  indeed  avoid  the  NP  hardness  issues  of  the 
general  problem.  We  will  show  that  for  such  problems  p  equals  its  upper  bound,  which  is  a  convex 
problem.  In  fact  it  is  not  even  necessary  to  solve  a  convex  optimization  problem,  and  we  are  able 
to  obtain  a  complete  solution  to  the  rank  one  problem,  in  terms  of  quantities  which  are  easily 
computed.  This  solution  also  provides  us  with  the  means  to  examine  the  properties  of  the  rank 
one  problem,  and  arrive  at  a  number  of  interesting  results  (for  example  an  “edge”  result  holds  for 
rank  one  mixed  p  problems). 
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2  Notation  and  Preliminaries 


The  notation  used  here  is  fairly  standard  and  is  essentially  taken  from  [6]  and  [15].  For  any  square 
complex  matrix  M  we  denote  the  complex  conjugate  transpose  by  M*.  The  largest  singular  value 
and  the  structured  singular  value  are  denoted  by  a(M)  and  pic(M)  respectively.  The  spectral 
radius  is  denoted  p(M )  and  pr(M )  =  max{\\\  :  A  is  a  real  eigenvalue  of  M},  with  Pr(M)  =  0 
if  M  has  no  real  eigenvalues.  For  a  Hermitian  matrix  M,  then  A (M)  and  A min(M)  denote  the 
largest  and  smallest  (real)  eigenvalues  respectively.  For  any  complex  vector  x,  then  x*  denotes  the 
complex  conjugate  transpose  and  |®|  the  Euclidean  norm.  We  denote  the  k  X  k  identity  matrix  and 
zero  matrix  by  Ik  and  Ok  respectively. 

The  definition  of  p  is  dependent  upon  the  underlying  block  structure  of  the  uncertainties,  which 
is  defined  as  follows.  Suppose  we  have  a  matrix  M  6  CraXn  and  three  non-negative  integers  mr, 
mc,  and  mp  (with  m  :=  mT  +  mc  +  me  <  n)  which  specify  the  number  of  uncertainty  blocks  of 
each  type.  Then  the  block  structure  IC(mr,mc,mc)  is  an  m-tuple  of  positive  integers 


K,  —  (Aq,  •  •  • ,  kmr ,  k . 


mr  i  ",mr+ 1  *  ■ 


•  j  kn 


mr+mc  i  kmT+mc+l  ? 


■  >  km) 


(1) 


This  m-tuple  specifies  the  dimensions  of  the  perturbation  blocks,  and  we  require  hi  =  n 

in  order  that  these  dimensions  are  compatible  with  M.  This  determines  the  set  of  allowable 
perturbations,  namely  define 


XK  =  {  A  =  block  diag(6[Ikl 6rmrIkmT ,  S^Ikmr+1 , . . . ,  bcmIkmr+mc ,  Af , . . . ,  A^c)  : 

$T  £  R,  Sf  G  C,  Ap  6  cfmr  +  mc  +  i  X^mr  +  mc  +  i  | 


(2) 


Note  that  X/c  €  CnX”  and  that  this  block  structure  is  sufficiently  general  to  allow  for  (any  com¬ 
bination  of)  repeated  real  scalars,  repeated  complex  scalars,  and  full  complex  blocks.  The  purely 
complex  case  corresponds  to  mr  =  0,  and  the  purely  real  case  to  mc  =  me  =  0. 

Note  also  that  all  the  results  which  follow  are  easily  generalized  to  the  case  where  the  full  complex 
blocks  need  not  be  square,  and  the  blocks  may  come  in  any  order.  We  make  these  restrictions  in 
(2)  purely  for  notational  convenience. 

Definition  1  ([5])  The  structured  singular  value,  gtc(M),  of  a  matrix  M  £  CnXn  with  respect  to 
a  block  structure  IC(mr,mc,mc)  is  defined  as 

Pk(M)  =  ^mm{ff(A)  :  det(I  -  AM)  =  0}^  (3) 

with  pic(M)  =  0  if  no  A  E  Xjc  solves  det{I  —  AM)  =  0. 

In  order  to  develop  the  relevant  theory  we  need  to  define  some  sets  of  block  diagonal  scaling  matrices 
(which,  like  p  itself,  are  dependent  on  the  underlying  block  structure). 

BX,c  =  {AEXK:  a( A)  <  1}  (4) 

Qk  =  {A  e  :  sr  £  [-1  1  =  l,Af-Ap  =  4„r+,e+,}  (5) 


Vic  =  {block  diag{^6lD\,. . . ,  e^0mr Dmr,D 


mr+li  •  •  •  5  Dmr+mci  "'i-‘«mr+mc+ 1  ’ 
7T  7 T 


;  1  &\Ikn 


•  i  dmc 


^ km  )  • 


°i  e  l- 2  2^°  <  D 


(6) 
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VK  —  { block  diag(Di, . . . ,  Dmr+mc,  di/fcmr+mc+1  dmcIkrn)  : 

Dt  =  D*  G  Ck‘xk',dt  G  R}  (7) 

VK  =  {D  G  Vk,  :  D  >  0}  =  {D  G  VK  :  0{  =  0,  i  =  1, . . . ,  mr}  (8) 

£jc  =  {Woe*  , GW,  Ofemr+1 , . . . ,  OkJ  :  Gt  =  G*  G  Cki*ki}  (9) 

We  introduce  one  further  piece  of  notation.  Suppose  M  G  CnXn  has  an  eigenvalue  A  with  right  and 
left  eigenvectors  x  and  y  respectively.  Then  partition  x  and  y  compatibly  with  the  block  structure 
as 


where  xr> , yr%  G  Ck\  xCi,yCi  G  Xd,yCi  G  Ckmr+m^+‘.  These  will  be  referred  to  as  the  “block 

components”  of  x  and  y,  and  we  define  the  “non-degeneracy”  assumption  to  be  that  for  every  i  (in 
the  appropriate  set),  |y*  xri\  ±  0,  \y*txCi\  ±  0,  \yci\\xct\  £  0. 

3  “Kharitonov- Type”  Analysis 

Before  embarking  on  a  study  of  the  rank  one  mixed  g,  problem,  we  first  place  this  problem  in 
context,  by  considering  the  “affine  parameter  variation”  problem,  for  a  polynomial  with  perturbed 
coefficients.  This  formulation  of  the  “affine  parameter  variation”  problem  is  fairly  standard,  and  is 
taken  from  [10]. 

Consider  a  real  monic  polynomial  in  the  complex  variable  s,  whose  coefficients  are  affine  func¬ 
tions  of  a  real  vector  of  uncertainties,  k  G  Rm 

p(s,  k)  =  sn  +  a1(fc)sn_1  +  a2(fc)sn_2  +  . . .  +  an(k)  (11) 

where  a,i(k )  for  i  =  1, . . . ,  n  are  affine  functions  of  k,  i.  e. ,  there  exists  F  G  RnXm  and  g  G  Rm  such 
that 

a(k)  =  [ai(Ar)  a2(k)  ...  an(k)]T  =  Fk  +  g  (12) 

Thus  we  can  rewrite  this  set  of  polynomials  as 

p(s,k)  =  sn  +  [sn-1  sn~2  ...  l](Fk  +  g)  (13) 

Since  this  polynomial  will  typically  be  the  closed  loop  characteristic  polynomial  of  some  uncertain 
system,  we  will  say  that  it  is  stable  if  it  has  all  its  roots  in  the  open  left  half  plane.  We  assume 
nominal  stability,  i.  e. ,  p(s,  0)  is  stable.  Thus  in  order  to  check  robust  stability  we  can  show 
by  simple  continuity  arguments  that  it  suffices  to  check  that  the  polynomial  has  no  roots  on  the 
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imaginary  axis  for  any  k.  Assume  in  the  following  analysis  that  we  are  considering  points  on  the 
imaginary  axis,  i.  e. ,  s  =  ju>  where  w  G  R.  Note  that  since  we  have  nominal  stability  p(s,  0)  ^  0 
for  all  s  =  jw.  Thus  we  have 

p(s,  0)  =  sn  +  [sra_1  sn~ 2  ^  0  (14) 


Now  note  that  we  have  a  root  of  the  uncertain  polynomial  on  the  imaginary  axis  iff  for  some  s  =  }w 
and  some  k  G  Rm,  p(s,k)  =  0.  This  can  be  reformulated  as 


p(s,k)  =  0 


[sn-l  sn-2 

v*k  =  1 
1  —  v*k  =  0 


1  \Fk  =  -sn  -  [s71-1  s"~2  ...  1  ]g 


where  v  G  Cm  is  given  by 


But  now  define  the  quantities  A  G  RmXm  and  u  G  Rm  by 


A  =  diag(ki,...,km ) 
u  =  (1  ...  1)T 


1]* 


(15) 


(16) 


(17) 

(18) 


and  we  have  that  A u  =  k.  Thus  an  equivalent  condition  for  the  existence  of  an  imaginary  axis  root 
of  the  uncertain  polynomial  is  given  by 


1  —  v*k  =  0  < — »  1  —  v*A  u  =  0 

■* — *  det(Im  —  A  uv*)  —  0 

< — >  det(Im  -  AM)  =  0  (19) 

where  M  G  CmXm  is  the  dyad  M  —  uv*.  Checking  this  condition  is  exactly  a  rank  one  p  problem 
and  thus  we  see  that  the  “affine  parameter  variation”  problem,  for  a  polynomial  with  perturbed 
coefficients,  is  a  special  case  of  the  rank  one  mixed  p  problem  (with  only  real  perturbations). 

It  is  possible  to  consider  a  number  of  different  stability  problems  arising  from  this  set-up,  by 
allowing  for  different  stability  regions,  and  different  norms  to  measure  the  size  of  k.  We  will  only 
be  interested  in  the  case  where  stability  is  associated  with  all  the  roots  in  the  open  left  half  plane, 
and  we  use  |&|oo  =  maxt<m  \k{\  to  measure  the  size  of  k.  In  this  case  we  find  that  we  can  use  the 
standard  definition  of  p,  and  we  are  required  to  compute  the  peak  value  across  frequency  of  p,  for 
a  transfer  matrix  which  is  rank  one.  The  treatment  of  other  norms/regions  is  discussed  in  [4]. 

A  number  of  different  stability  results  have  been  presented  for  this  type  of  problem.  One  of  the 
strongest  motivations  for  pursuing  these  problems  was  provided  by  Kharitonov’s  celebrated  result 
for  “interval  polynomials”  [7].  This  is  a  special  case  of  the  above  setting  where  one  further  restricts 
the  uncertainty  description  to  be  of  the  form 


ai(k)  =  a,i  +  biki  for  i  =  1, . . . ,  m 


(20) 
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where  a *•  £  R,  &,■  6  R  and  m  =  n.  Thus  the  coefficients  of  the  polynomial  are  independent  of  each 
other,  and  known  only  to  lie  within  certain  intervals.  For  this  problem  it  was  shown  in  [7]  that 
one  need  only  check  four  specific  polynomials  to  establish  stability  of  the  whole  family.  This  is 
clearly  a  polynomial  time  computation,  and  we  have  restricted  the  problem  sufficiently  to  beat  the 
NP-hardness  of  the  general  problem.  In  doing  so  however  we  have  placed  quite  severe  restrictions 
on  the  allowable  problem  class,  and  so  the  applicability  of  the  result  to  engineering  problems  is 
rather  limited. 

If  we  consider  the  “affine  parameter”  case,  then  it  was  shown  in  [1]  that  it  suffices  to  check 
stability  of  the  edges  of  the  parameter  space,  i.  e. ,  we  may  take  every  element  of  k  except  one  to  be 
at  an  extremal  value  of  it’s  allowed  range.  Note  that  this  requires  checking  a  combinatoric  number 
of  edges,  so  that  even  this  increase  in  the  generality  of  the  problem  produces  dramatic  increases  in 
computation.  Exact  results  for  this  type  of  problem  typically  involve  checking  the  vertices  or  edges 
of  some  polytope  in  the  parameter  space,  and  hence  involve  exponential  growth  in  computation  (see 
[12]).  If  one  is  prepared  to  allow  a  frequency  search  then  this  exponential  growth  can  be  avoided 
(see  [10]).  This  can  also  be  seen  from  the  framework  we  will  develop  here,  since  these  problems  can 
all  be  tackled  as  rank  one  p  problems,  which  we  will  see  can  be  easily  solved. 

Thus  we  will  find  that  it  is  possible  to  develop  exact  robust  stability  tests,  in  the  p  or  polynomial 
frameworks,  for  this  type  of  problem.  Of  course  we  must  note  once  more  that  the  applicability  of 
this  rank  one  p  analysis  is  rather  limited,  and  the  fact  that  the  general  problem  is  NP  hard 
strongly  suggests  that  results  for  this  case  cannot  be  usefully  extended  to  the  general  case.  This 
is  the  reason  why  the  “Kharitonov-type”  analysis  methods  do  not  extend  to  the  more  general 
“multilinear”  or  “polynomial”  cases  (which  correspond  to  more  general  p  problems),  and  one  is 
forced  to  use  approximate  and/or  iterative  methods  (see  [11,  16]  for  example). 


4  Equivalence  with  the  Upper  Bound 


This  section  is  devoted  to  proving  the  main  result  of  this  paper,  which  is  that  for  a  rank  one  p 
problem,  the  upper  bound  always  achieves  p ,  regardless  of  the  block  structure.  A  preliminary 
version  of  this  result  was  proven  in  [14],  where  additional  assumptions  were  imposed  to  make  the 
proof  fairly  simple.  Here  we  will  not  make  any  such  assumptions  on  the  problem,  and  it  turns  out 
that  this  makes  the  proof  substantially  more  difficult.  First  we  recall  the  mixed  p  upper  bound 
from  [6],  stated  here  in  a  slightly  different  form. 


Theorem  1  ([6])  For  any  matrix  M  6  CnX”,  and  any  compatible  block  structure  1C,  suppose  a * 
is  the  result  of  the  minimization  problem 


a* 


mi  |mjn{a  :  ( M*DM  +  j{GM  -  M*G)  -  aD)  <  0} 


(21) 


then  an  upper  bound  for  p  is  given  by 


Ptc{M)  <  ^/max(0,a*) 


(22) 


Note  that  this  upper  bound  is  a  convex  minimization  problem  so  that  we  can  compute  the  global 
minimum,  and  algorithms  for  this  computation  have  been  developed  (see  [17,  2]). 
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Theorem  2  Suppose  we  have  a  rank  one  matrix  M  E  CnXn ,  then  for  any  block  structure,  1C, 
ptc(M)  equals  its  upper  bound  from  theorem  1. 

This  theorem  gives  us  the  means  to  compute  rank  one  p  problems  exactly, 
equivalent  to  consider  the  upper  bound  problem,  which  is  convex  (in  fact 
it  is  not  even  necessary  to  resort  to  convex  programming  methods  to  solve 
tackling  the  proof  of  this  theorem,  we  need  a  few  preliminary  results. 

Lemma  1  Suppose  we  have  matrices  A  E  C”x”  and  B  E  CnXn,  with  A  <  0.  Define  S  =  {x  E  Cn  : 
x* Ax  =  0,  | a: |  ^  0}.  Then  we  have  that  A  +  tB<  0  for  sufficiently  small  0  <  t  E  R  iff  either  S  =  0, 
or  x*Bx  <  0  for  all  x  E  S. 

Proof:  (— >)  Since  we  have  that  for  all  x  ^  0 

x*(A  +  tB)x  =  x*Ax  +  tx*Bx  <  0 

with  t  >  0,  then  if  for  any  x  ^  0  we  have  x*Ax  =  0,  for  that  x  we  must  also  have  x*Bx  <  0. 

(<— )  If  S  =  0  then  A  <  0,  and  so  A  +  tB  <  0  for  sufficiently  small  t  >  0  by  a  simple  continuity 
argument.  Suppose  instead  that  S  ^  0,  but  x*Bx  <  0  for  all  x  E  S.  Define  S  =  S  n  B  where 
B  =  {x  :  |x|  =  1}.  Then  S  C  S  is  compact  and  so  by  continuity  there  exists  a  set  V  D  S,  which  is 
open  in  B,  with  x*Bx  <  0  for  all  x  E  V.  Thus  we  have 

x*(A  +  tB)x  =  x*Ax  +  tx*Bx  <  0 

for  all  x  E  V,  for  any  t  >  0.  If  B\V  =  0  we  are  done  immediately,  so  assume  B\V  ^  0.  Now  B\V  is 
compact  and  so  both  x* Ax  and  x*Bx  achieve  maxima  on  B\V.  Suppose  we  have 


since  it  says  that  it  is 
we  will  see  later  that 
this  problem).  Before 


max  x*Ax  =  -a 

x  eB\v 

max  x*Bx  —  Q 
xeB\v 


with  a  >  0  since  /?\V  fl  <S  =  0.  If  /?  <  0  then  x*(A  +  tB)x  <  0  for  all  x  E  B\V  for  any  t  >  0  and 
we  are  done,  so  assume  /?  >  0.  Then  we  have 


x*(A  +  tB)x  —  x* Ax  +  tx*Bx  <  —a  +  t(3  <  0  for 


Thus  we  have  x*(A  +  tB)x  <  0  on  5\V  for  sufficiently  small  t  >  0.  Combining  this  with  our  earlier 
result  we  have  that  for  sufficiently  small  t  >  0,  x*(A  -f  tB)x  <  0  for  all  x  E  B,  or  in  other  words 
A  +  tB  <  0  .  □ 


Lemma  2  Suppose  we  have  a  rank  one  matrix  M  =  uv* ,  with  u,v  E  C",  and  a  block  structure  K. 
Then  there  exists  a  sequence  of  matrices  D1  E  T>^  such  that  the  following  is  true: 

1.  Define  u3  =  DJu,  v3  =  D3~1v  and  the  following  limits  exist: 


lim  u3  =  u 

j—>  OO 

lim  v3  =  v  (23) 

j—KX>  V  ' 
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2.  Partition  u,v,u,v  compatibly  with  the  block  structure  as  in  (10).  Then  we  have  that  the 
following  is  true: 


v*Mri  =  v*.uTi 

for 

*  =  1, . . .,  mr 

Kfilc,  =  V*.UCi 

for 

i  =  1 

T*Ct  AuCl  =  v*cAuc> 

for 

i  —  1, . . .,  me  for  any  A  £  c^+mc+ix^mr+mc+i 

(24) 


3.  With  this  notation  we  also  have  that 


Ki«ril  =  K.l2  =  K,|2 

TT  |2  _  1 2 


K«cJ  =  KI  = 

l®Cill«c«l  =  I«c,||mc,|  =  \va\2  =  \ucf 

4-  Define  M3  =  uJv3*  and  the  following  limit  exists 


j— >oo 


J— *00 


for  i  =  1, . . 

. ,  mT 

for  i  —  1, . . 

,,mc 

for  *  =  1,.. 

.,mc 

(25) 

=  M 

(26) 

,  vri  for  i  =  1 , . . 

. ,  mr. 

Suppose  first 

5.  We  have  M  =  u  v* ,  with  p,/c(M)  =  pic(M). 

Proof:  Consider  first  the  repeated  real  scalar  blocks,  i.e.  , 
that  for  some  i  in  this  range  we  have  that  |  v*.  uri  |  ^  0.  Then  we  have 

K«r,-|  =  7^ 

for  some  7,  9  6  R  with  7  >  0.  Thus  we  have  that 

v*  (e~'i9uri)  =  7  >  0 

and  so  by  lemma  4.2  from  [9]  we  can  conclude  that  there  exists  Z?,  =  DJ  >  0  such  that 

vr,  =  e_J’ 6  Dfuri 

Choose  D\  =  D{  for  all  j  and  so  for  this  block  we  have  for  all  j 

Kl  =  I  Dflvn\  =  \e~ieDiUri\  =  |A'Ur,|  =  |<| 

and  furthermore 


=  |<e  J euri\  =  \v*.Di  2vri\  =  I D{  Vj2  =  \vi.\ 


Finally  note  that 


«,  =  =  <,«r, 


For  this  block  then  properties  1, 2,  3  all  hold. 

Now  suppose  that  for  some  i  <  mr  we  have  \v*.uTi\  =  0,  or  in  other  words  vTi  orthogonal  to  uri 
(if  we  have  |ur<|  =  0  and/or  \ur>  \  =  0  then  see  the  treatment  for  the  full  complex  blocks  later).  Thus 
we  can  choose  a  Hermitian  positive  definite  matrix  D\  with  uri  as  an  eigenvector  corresponding  to 
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the  smallest  eigenvalue  of  D\,  and  vTi  as  an  eigenvector  corresponding  to  the  largest  eigenvalue. 
Now  simply  choose  such  D\  with 

A(J D?)  t  oo  as  j  t  oo 

I  o  as  j  t  OO  (27) 

and  we  have  that 

lim  \v3r.\  =  lim,_,.oo  \D\~1vri  |  =  0 

j-*oo 

lim  \u3r.\  =  limj_oo  \Dluri\  =  0  (28) 

j~* oo  * 

so  that  properties  1,2,3  hold  once  more.  The  treatment  for  the  repeated  complex  scalar  blocks 
(i.  e. ,  uCi  ,vCi  for  i  =  1, . . . ,  mc )  is  identical. 

Now  consider  the  full  complex  blocks,  i.e. ,  uc,-,  for  i  —  1, . .  .,mc.  First  consider  a  block 
where  (for  some  i  in  the  appropriate  range)  | vc,  1 1 uq,  |  ^  0.  Choose  the  scalar  di  as 


d{  — 


(29) 


and  choose  D\  =  dilkmr+tnc+i  for  all  j.  Then  for  this  block  we  have  that 


I  uc,  1 2  =  I  diUCi\2  =  df\uCi  |2 
WCi?  =  \j*Ci\ 2  =  jy M 2 


=  j^jlud2  = 


and  furthermore  for  any  A  6  Ckm’-+m^+'xkmr+rn^+'  we  have 


vCiAuCi 


(I, 

Wt 


V 

I 

"V 


A  ^  rl -1 


-L-  .  V 


=  V, 


a  . 


£V 


*Ci 


(30) 


Note  that  this  implies 

K.lkc.l  =  KJIttCjl 

so  that  properties  1, 2, 3  all  hold  for  this  block. 

Suppose  now  that  for  some  i  we  have  |  vct  1 1  uc,  |  =  0,  and  hence  \uc{\  —  0  and/or  [  vct  |  =  0.  If 
both  are  zero  simply  choose  D\  =  ffcmr+mr+,  for  all  j,  and  it  is  easy  to  check  that  properties  1,2,3 
all  (trivially)  hold.  If  not  then  choose  D\  =  dJiIiCmr+rnc+i  where  rf-  is  chosen  to  satisfy 


dj  t  oo  as  j  |  oo  if  J’U.Oj  I  =  0 

d\  i  0  as  j  T  oo  if  1^1  =  0 


In  either  case  we  have  that 


•lim  \vc,\ 

j—*oo  * 

lim  \u3Ct\ 

J-+CO  1 


lim^co  \jjvcl\  =  0 
limj_.oo  \d\uci  |  =  0 


(31) 


(32) 
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so  that  properties  1,2,3  hold  once  more.  Thus  stacking  up  the  blocks  we  have  constructed  we 
obtain  our  D3  £  Vk,  satisfying  properties  1,2,3. 

Now  note  that 

M 3  =  u3v3*  =  D3uv*D3~1  =  D3MD3~ 1 

is  a  product  of  convergent  sequences,  u3  and  v3,  and  hence  converges,  so  we  have  property  4. 
Furthermore  by  standard  properties  of  limits  we  have  that  M  =  uv*.  Note  finally  that  employing 
properties  1, 2, 3, 4  we  obtain  that  for  any  A  £  Xk, 


det(In  -  AM)  =  det(In-Auv*) 


=  1  —  v*  Au 

mr 

mc 

mc 

=  1  -  6iV*t  Ur, 

-Y^cA^Ci 

2=1 

2=1 

2=1 

TT iy 

rac 

mc 

=  i  -  uri 

-  Y.  6iVc,  UCi 

-YVCiA?uCi 

2  =  1 

2=1 

i= 1 

=  det(In  -  AM) 


and  hence  prc{M )  =  wc(M),  which  is  property  5.  □ 


(33) 


Remarks:  Note  the  the  sequence  of  matrices  D3  £  X>£  satisfies  the  following: 


lim  a(D3MD3  *) 

j—HX> 


=  a(M)  —  |u||u|  =  |u|2 


mr  mc  me 

=  Y  k \2+Y  k  i2 + Y  i2 

2=1  2=1  2=1 

mT  mc  me 

=  £  I  %  uTi  i  +  uCi  l  +  \vCi  i  |wc,  i 
2  =  1  2=1  2=1 

mr  mc  me 

=  z2 \vri  ur>  i  +  Y, \v*> u*  i  + 12 \vci  i \uc> i 

i=l  i=l  i= 1 


(34) 


This  final  expression  is  exactly  fi  for  the  associated  complex  fi  problem.  Therefore  this  lemma 
proves  that  complex  /i  equals  its  upper  bound  for  rank  one  matrices  (which  is  well  known),  and  in 
the  process  we  explicitly  constructed  the  sequence  of  scaling  matrices  that  does  the  job.  Thus  the 
sequence  D3  £  V/c  is  exactly  the  optimal  scalings  from  the  upper  bound  of  the  associated  complex 
fi  problem. 

Note  that  this  lemma  provides  us  with  a  /i  invariant  transformation  from  M  to  M.  The  point  of 
carrying  out  this  transformation  is  that  property  3  implies  that  the  vectors  u ,  v  of  the  dyad  M  are 
perfectly  balanced  in  the  sense  that  each  sub  block  of  u  has  the  same  norm  as  the  corresponding 
sub  block  of  v.  Consequently  we  have  that  for  each  sub  block  of  u  and  v  we  either  satisfy  the  non 
degeneracy  assumptions,  or  the  corresponding  sub  blocks  of  u  and  v  are  both  identically  zero. 


Theorem  3  Suppose  we  have  M  =  uv*  with  u,  v  £  Crt  .  and  a  compatible  block  structure  1C,  with 
pic(M)  >  0.  Partition  u,v  with  respect  to  this  block  structure,  and  assume  that  for  each  sub  block 
of  u  and  v  we  either  satisfy  the  non  degeneracy  assumptions,  or  the  corresponding  sub  blocks  of  u 
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and  v  are  both  identically  zero.  Then  there  exist  matrices  Q  G  Qk  and  Q  G  Qk  with 


{1  if  i  =  j  <  mr  and  Qij  =  0 
Qij  otherwise 


(35) 


together  with  matrices  Dr  6  T>k,Dr  >  0  and  Di  G  Qk,  and  a  real  scalar  ip  G  (-§  f )  such  that 


v  =  e?*  (DrQ  +  iDiO)  u 


(36) 


with  0  <  v*Qu  —  /j,k(M). 

Proof:  The  proof  of  this  theorem  is  essentially  an  application  of  the  same  machinery  used  to  prove 
theorem  2  in  [15].  First  suppose  that  pk(M)  =  1.  Then  by  theorem  1  in  [15]  there  exists  Q  G  Qk 
with  pr(QM)  =  1.  Since  we  may  absorb  a  factor  of  ±  into  Q  we  may  assume  that  the  eigenvalue 
achieving  pr(QM)  is  positive.  But  note  that  this  implies 

0  =  det(In  —  QM)  =  det(In  —  Quv*)  —  1  -  v*Qu  (37) 


and  hence  0  <  v*Qu  —  1  =  pk(M). 

Since  M  is  rank  one,  so  is  QM,  and  hence  QM  has  at  most  one  non-zero  eigenvalue  (not 
repeated).  Thus  the  eigenvalue  at  one  is  distinct  and  furthermore  we  have  that 

QM(Qu)  =  Quv*Qu  =  ( Qu ) 

v*QM  =  v*Quv*  =  v*  (38) 


Since  Qu  and  v  are  both  non- zero  this  implies  that  they  are  the  right  and  left  eigenvectors  of 
QM  corresponding  to  the  unity  eigenvalue,  and  furthermore  they  are  normalized  with  v*(Qu )  =  1. 
Since  this  eigenvalue  is  distinct  we  can  differentiate  it,  and  applying  the  machinery  from  [15]  to 
M  =  QM  we  can  derive  that  the  following  relations  hold 


Reie^qlvp'Ur,)  >  0 
Re(e3^  qriV*.uri)  =  0 
e^q-v*^  e  [0  oo) 

Re(e^v*c.GfQfuCi)  <  0 


,  ?  —  1 , . . . ,  m,. 

,  if  i  <  mr  and  \q]\  <  1 

,  t  =  l,...,mc 

,  for  all  GQ  G  C^mr+mc+' x^mr+mc+' 
with  Gf  +  Gf*  <0,  i  =  1, . . . ,  me 


(39) 


for  some  ip  G  (— §  § ).  Now  we  may  apply  lemmas  4.1  and  4.2  from  [9]  to  each  sub  block  to  obtain 


»rj  =  ,  0  <  Di  =  D*,  0i  G  [--  ^],  i  <  mT  and  |g[|  =  1 


vTi  =  e^e^e'qriDiUri  ,  0  <  =  D*,  =  ±^,  i  <  mr  and  0  <  |g[|  <  1 


vri  =  e^e^6' DiUri  ,  0  <  Di  =  D*,  0i  =  i  <  mr  and  \q^\  =  0 


7 r 


Vci  —  q%  Dnir+iUci  ,  0  <  Dmr+i  —  R^rnr-\-%i  *  —  !>•••» 


mr 


VC, 


,  =  e^diQfud  ,  0  <  di  G  R,  *  =  1, .  ..,mc. 


(40) 
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Note  that  in  order  to  apply  these  lemmas  we  need  to  assume  that  the  non- degeneracy  assumptions 
are  satisfied  for  that  sub  block.  However  we  have  assumed  at  the  outset  that  u,  v  either  satisfy  the 
non- degeneracy  assumption  for  a  given  sub  block,  or  have  both  sub  blocks  of  u  and  v  identically 
zero,  in  which  case  the  above  relationships  hold  trivially.  Applying  this  argument  to  v  and  Qu  we 
get  that  the  above  relationships  hold  for  every  block.  The  only  case  where  this  argument  breaks 
down  is  for  the  repeated  real  scalar  blocks  with  qj  =  0.  For  these  blocks  however  we  can  show  that 
the  above  relationships  hold  by  a  simple  geometric  argument. 

All  that  remains  now  is  to  define  the  appropriate  quantities.  Define  Q  £  Qg  directly  from 
Q  G  Qk.  via  (35).  For  i  <  mT  note  that 

ej6'  -  cos(04)  +  j  sin(Oi) 
so  that  we  may  split  each  scaling  matrix  as 

ej0iDi  =  cos (Oi)Di  +  j  sm(0i)Di  =  Dr{  +  _/£>/,  for  *  =  1, . . . ,  mr  (41) 

For  the  complex  blocks  we  simply  define 

DRmr+i  =  Dmr+i  for  i  =  1, . . . ,  mc 

dR{  =  di  for  i  =  l,...,mc  (42) 

Now  stack  these  definitions  up  to  define  Dr  £  Vic  and  D]  £  Q/c-  Since  for  each  Si  we  have  that 
di  G  [—§§],  it  follows  that  cos(0;)  >  0  and  hence  Dr  >  0.  It  is  now  easy  to  verify  that  with  these 
definitions  the  relationships  in  (40)  may  be  written  in  matrix  form  as  v  =  ( DrQ  +}DjQ')  u. 

This  proves  the  result  for  mc{M)  =  1.  The  result  for  p>c(M)  >  0  follows  immediately  from  this  by 
simply  scaling  u  so  that  —  1,  applying  the  result  for  pk(M )  =  1,  and  then  reabsorbing  the 

scaling  back  into  Dr  and  Di.  □ 

Remarks:  Although  it  appears  at  first  sight  as  a  rather  unmotivated  mathematical  abstraction,  we 
will  see  that  in  fact  this  alignment  condition  between  v  and  u  is  the  key  to  the  equivalence  between 
fj,  and  it’s  upper  bound.  Note  also  that  we  have  such  an  alignment  for  any  Q  £  Q/c  achieving  a 
local  maximum  of  pr(QM)  over  Q  £  BX/e  with  pr(QM )  >  0.  This  follows  since  we  we  derived  the 
alignment  condition  simply  from  stationarity  conditions,  and  did  not  use  the  fact  that  pic(M)  is 
the  global  maximum  at  all. 

Note  that  the  conditions  on  u  and  v  assumed  in  theorem  3  are  exactly  those  guaranteed  for  the 
transformed  vectors  in  lemma  2.  Thus  by  first  transforming  the  dyad  as  in  lemma  2  we  can  show 
that  (for  the  transformed  dyad)  we  always  have  an  alignment  condition  as  in  theorem  3,  without 
requiring  any  type  of  non- degeneracy  assumptions  (except  that  pic(M)  >  0). 

We  are  now  in  a  position  to  combine  these  results  to  prove  the  main  result. 

Proof  of  Theorem  2:  We  will  consider  separately  the  cases  p,/c(M)  >  0  and  pic(M)  —  0.  First 
suppose  that  p>c(M)  >  0  and  for  the  moment  further  assume  that  in  fact  p/c(M)  =  1.  Carry  out  the 
transformation  of  lemma  2  to  define  u,  v,  M  with  ptc(M )  =  1.  Now  u,v,  M  satisfy  the  assumptions 
of  theorem  3,  so  we  may  apply  this  theorem  to  conclude  that  we  have  matrices  Q  £  Qic  and 
Q  £  Qk  with  Q  as  in  (35),  together  with  matrices  Dr  £  T>ic,Dr  >  0  and  Dj  £  Qyc,  and  a  real 
scalar  ^  £  (— ^  -|)  such  that 

v  =  ^  (DrQ  +  j DiQ}  u  (43) 
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with  v*Qu  =  1.  Note  that  this  implies 

v*Qu  =  e-J *u*(Q*DR  -  j Q*Di)Qu  =  1 

and  hence 

u*Q*DrQu  -  j =  cos(^)  +  j  sin(^)  (44) 

Now  we  have  immediately  that  0  <  u*Q*DrQu  E  R  and  furthermore 

u*Q*DiQu  =  u*Q*QDiu  =  u*Q*QDju  =  u*Q*DjQu 

so  that  u*Q*DiQu  E  R  as  well,  and  hence  by  comparing  real  and  imaginary  parts  in  (44) 

cos(^)  =  u*Q*DrQu  =  u*Q*QDru  —  u*  Dru  (45) 

Define  G  E  Qk  as  G  =  -DjQ,  and  substituting  Dr  and  G  into  the  upper  bound  expression  we 
obtain 

M*DrM  +  j  (GM  -  M*G)  -  Dr 
=  vu*DRUv*+j(—DiQuv*  +  vu*DiQ)  —  DR 

=  -v  v*(u*Dru)  -  Dr  +  DrQu  v*  +  v  u*Q*Dr  (46) 

where  we  have  made  use  of  the  substitution  -j DiQu  =  -e~^v  +  DrQu  and  (45).  Thus  for  any 
r]  E  C”  we  have  that 

t 7*  (m*DrM  +}(GM  -  M*G)  -  Dr)  rj 

=  2 Re  (( Ti*DRQu)(v*r ?))  -  ( u*Dru)\v*t]\ 2  -  tj*Drti 
=  -((v*r/)Qu-  r)T  DR((v*rj)Qu-  rj)  <  0  (47) 

Now  note  that  we  get  equality  (to  zero)  in  (47)  iff 

( v*r})Qu  —  rj  =  —x  (48) 

where  x  E  Ker(DR),  because  Dr  >  0.  But  this  implies 

v*r/  =  (v  *Qu)(v*tj)  +  v*x  —  v*rj  +  v*x 

and  hence  v*x  =  0.  From  this  it  is  easy  to  check  that 

rj  =  ( v*r])Qu  +  x  < — *  T)  =  aQu  +  x  for  some  a£C. 

To  summarize,  we  have  shown  that 

j?*  (~M* DrM  +  j(GM  -  ~M*G)  -  Dr)  •q  <  0  (49) 

with  equality  iff  r]  is  of  the  form 

r/  =  aQu  +  x  (50) 
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for  some  a  £  C  and  x  £  Cn  with  Drx  =  0  and  v*x  =  0.  Now  define  D  =  Dr  +  tln,  where  0  <  t  £  R 
will  be  chosen  later,  and  note  that  for  t  >  0,  D  £  Xbc-  Now  choose  any  e  >  0  and  we  have  that 

M*DM  +j(Gl  -M*G)-(l  +  e)D  =  (m*DrM  +  j(GM  -M*G)  -  Dr  -  eDR) 

+  t  (jrf  M  —  (1  +  e)/n) 

=  A  +  tB.  (51) 

We  wish  to  apply  lemma  1  to  (51).  Note  that  since  we  have 

M*DrM  +  j(GM  -H*G)-  Dr  <  0 

—cDr  <  0 

we  immediately  have  that  A  <  0,  and  furthermore  rf  Ar]  —  0  iff 

??*  ( 'WDrM  +  j {GM  -  M*G)  -  Dr )  t)  =  0 

V*DRr)  =  0 


From  our  earlier  results  this  implies 

r ]  =  aQu  +  x 

with  Drx  —  0,  v*x  =  0,  and  t]*DrT]  =  0.  But  since  Dr  >  0  this  implies  that  r]  6  Ker(DR),  and 
hence  aQu  £  Ker(DR).  Now  note  that  this  gives 

0  =  (aQu)*  DR(aQu)  =  |a|  2u*Q*DrQu  =  |a|2cos(^) 

Since  ^  €  (— f  f )  we  have  cos(^)  >  0,  and  so  this  expression  vanishes  iff  a  =  0.  Thus  we  obtain 
r]  —  x  and  so  we  have 

rf  Ar]  =  0  < — *  DrT]  =  0  and  v*  r\  =  0 

Now  note  that  for  such  t?  we  have 

rfBr]  =  +€)In)v 

=  T]*(v  u*uv*  -  (1  +  €)In)r] 

=  -(1  +  e)|f?|2  <  0  for  77  f  0  (52) 

Thus  applying  lemma  1  we  have  that  for  sufficiently  small  t  >  0,  A  +  tB  <  0  and  hence  choosing 
such  a  t  we  obtain 

M*DM +j(GM -M*G)~  (l  +  e)D  <  0  (53) 

with  D  6  Vx:  and  G  €  Gk-  It  now  remains  to  unwrap  the  transformation  from  M ,  to  recover  the 
result  for  M.  Recall  from  lemma  2  that  we  have  a  sequence  D3  £  V/c  such  that 

lim  D3MD3~ 1  =  M 

j— *■  OO 

Since  this  implies  that  we  may  choose  Dj  £  D/c  so  that  DjMDj~x  is  arbitrarily  close  to  M,  by 
continuity  we  may  choose  D 3  £  V/c  so  that 

(D3MD3-lyD(D3MD3-l)+i(G(D3MD3-l)-(D3MD3-l)*G)- (1  + e)D  <  0  (54) 
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Since  D3  >  0  we  may  multiply  both  sides  of  this  expression  by  D3  without  affecting  the  definiteness 
to  yield 


M*(D3DD3)M  +j((DjGD3)M  -  M*(D3GDj)-(l  +  e)(D3DD3)  <  0  (55) 


so  that  defining  D  =  D3DD3  and  G  =  D3GD3  we  have 


M*DM  +j(GM  —  M*G)  -  (1  +  e)D  <  0 


(56) 


with  D  G  T>tc  and  G  G  Qk-  But  this  shows  that  the  upper  bound  achieves  y/l+~e  where  e  >  0  was 
arbitrary.  Thus  by  taking  the  infimum  we  find  that  the  upper  bound  gives  1,  which  by  assumption 
equals  This  proves  the  result  for  mc(M )  =  1.  The  extension  to  wc(M)  >  0  is  easy. 

Suppose  H)c(M)  =  (3  >  0.  Then  =  1  so  we  may  apply  this  result  to  obtain,  for  any  e  >  0, 

D  G  V/c  and  Gefe  such  that 


M*  -  M  ..~M 

jdj+«g7 


M*  - 

jG)-(l  +  e)D  <  0. 


(57) 


Now  given  any  e  >  0  choose  c  =  (1  +  4)2  —  1,  and  choose  D ,  G  as  above.  Then  this  expression  gives 


M*DM  +  j ((/3G)M  -  M*{t3G ))  -  (/?  +  e)2D  <  0 


so  that  defining  D  =  D  and  G  =  (3G  we  obtain 

M*DM+j(GM-M*G)-(f3  +  e)2D  <  0 


with  D  6  Vtc  and  G  G  Qtc,  and  hence  by  the  same  reasoning  as  before  this  implies  that  the  upper 
bound  achieves  (3,  which  by  assumption  equals  This  proves  the  result  for  fi/c(  M)  >  0. 

The  only  case  we  have  left  to  deal  with  is  when  / i/c(M )  =  0.  This  case  is  not  covered  by  the 
previous  analysis  since  in  this  case  there  is  no  destabilizing  perturbation  ( Q  G  Qtc)-  Start  first  by 
employing  the  transformation  to  M  in  lemma  2.  If  a(M)  =  0  we  are  done,  so  assume  not,  i.e. , 


|*rlUll  -/  n  {rriTnoi'li  rvf  nlrr  fli  nf 

j££jj£/j  y~  U.  uxictt 


^11  - 

CXXX  LXiC 


|«cj|  =  |»cil  =  0  for  t=l,...,mc 

\uc,  |  =  \vc,  I  =  0  for  i  =  1 , . . . ,  me  (58) 

else  we  could  find  Q  G  Qk,  with  pr(QM)  >  0  simply  by  choosing  all  the  real  perturbations  to  be 
zero,  and  appropriate  complex  perturbations.  Furthermore  it  is  easy  to  check  via  a  simple  geometric 
argument  that  for  every  non-zero  repeated  real  scalar  sub  block,  the  quantity  ,  must  be  the 

same  modulo  ±,  (for  i  =  l,...,mr),  and  this  quantity  cannot  be  purely  real.  If  this  were  not 
so  then  once  again  we  could  find  Q  G  Qk  with  (>r{QM)  >  0.  These  conditions  are  equivalently 
expressed  as 

v*.uri  =  7 for  each  i  <  mr  with  \v*.uri\  ^  0  (59) 

where  for  each  i  we  have  ^  =  ±|,  0  <  7,  G  H,  and  ip  G  (— ■§  7).  Thus  applying  lemma  4.2  from  [9] 
to  each  of  these  blocks  we  have 

vTi  =  ej^e?6' DiUri  for  each  i  <  mr  with  \v*MTt  \  ^  0  (60) 
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where  for  each  i  we  have  0  <  Di  =  D*.  Stacking  these  relationships  up  we  obtain  Dj  G  Gk  such 
that 

u  =  j  ^Dju  (61) 

Note  that  for  the  zero  sub  blocks  of  u  and  v  this  relationship  holds  trivially,  and  for  the  non-zero 
sub  blocks  it  follows  from  (60).  Now  choose  Dr  =  0„  and  G  =  —Dj.  Note  that  Dr  G  T>k,  Dr  >  0, 
G  G  Gk.  ,  and  for  any  0  <  e  G  R  we  have 

M*DrM  +  j (GM  -  M*G )  -  eDR 
=  j  (-Djuv*  +  vu*Dj) 

=  —2  cos  (x)j)vv*  <  0  (62) 

where  we  have  used  the  substitution  j Dfu.  =  e~^v.  Now  define  D  =  Dr  +  tln  =  tln ,  where 
0  <  t  G  R  will  be  chosen  later,  and  note  that  for  t  >  0,  D  G  Then  we  have  that 

(~M*Dr~M  +  j  (GM  -  M*G )  -  c£>r) 
t  (M*W  -  dn) 

A  +  tB.  (63) 

By  our  earlier  results  A  <  0  and  furthermore  we  have 

r/*Ar)  =  —2  cos(V>)|u*  r)\2  <  0  (64) 

with  equality  iff  v*r)  =  0  (since  cos(V’)  >  0).  For  such  rj  we  have  that 

r]*Br]  =  V*  (M*M  -  €/„)  y 
—  t]*  (v  u*u  v*  —  eln )  77 

=  —  c|?7|2  <  0  for  r]  7^  0  (65) 

Thus  applying  lemma  1  we  obtain  that  A  +  tB  <  0  for  sufficiently  small  t  >  0.  Choosing  such  a  t 
we  have  that 

M*DM+j(GM-M*G)-eD  <  0  (66) 

with  D  G  T>k  and  G  G  Gk-  Now  we  can  unwrap  the  transformation  in  lemma  2  exactly  as  before 
to  obtain  D  G  V/c  and  G  G  Gk  satisfying 

M*DM  +j(GM  -M*G)-eD  <  0.  (67) 

But  this  shows  that  the  upper  bound  achieves  i/e  where  e  >  0  was  arbitrary.  Thus  by  taking 
the  infimum  we  have  that  the  upper  bound  achieves  0  which  by  assumption  equals  /ijc(M).  This 
concludes  the  hk(M )  =  0  case,  and  combining  this  with  our  earlier  result  for  ytc{M)  >  0  the 
theorem  is  proved.  □ 


M*DM  +  j(GM  -  M*G)  -  eD  = 

+ 
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5  Additional  Properties 

Note  that  this  proof  is  substantially  more  involved  than  the  one  given  in  [14],  where  additional 
simplifying  assumptions  were  made.  Note  however  that  this  is  a  constructive  proof,  and  so  it 
actually  gives  us  quite  a  bit  more  information  about  the  rank  one  problem  than  we  obtained  with 
the  earlier  result.  We  are  able  not  only  to  say  that  the  upper  bound  achieves  //,  but  to  explicitly 
construct  the  D,G  scaling  matrices  that  do  the  job.  This  allows  us  to  examine  the  properties 
of  these  scaling  matrices  as  a  function  of  the  problem  data,  and  to  arrive  at  several  interesting 
conclusions. 

First  note  that  the  construction  of  the  optimal  sequence  of  D ,  G  scaling  matrices  was  based 
on  employing  the  “//-values”  construction  from  [14].  Note  that  theorem  3  holds  for  any  of  the 
“//-values”  (see  the  remarks  following  theorem  3).  To  be  more  specific  this  means  that  given  any 
Q  £  Q/c  achieving  a  local  maximum  over  Q  £  BX/c  of  pr(QM),  with  pp(QM)  =  0  >  0  then 
we  can  employ  the  machinery  of  lemma  2,  and  theorems  2  and  3,  to  construct  (a  sequence  of) 
D  £  Vtc,G  £  Qk  verifying  that  p/c(M)  <  0.  But  since  we  already  have  0  —  pr(QM )  <  p/c(M) 
this  gives  p/c(M)  =  0.  Thus  we  find  that  for  the  rank  one  problem  any  non-zero  local  maximum 
of  pn(QM)  (as  defined  above)  is  global.  We  state  this  as  a  theorem. 

Theorem  4  Suppose  we  have  M  —  uv* ,  with  u,v  £  Cn,  and  a  compatible  block  structure  JC. 
Further  suppose  we  have  Q  £  Q/c  such  that  pp(QM )  =  0  >  0  is  a  local  maximum  of  pr(QM)  over 
Q  £  BX/c-  Then  0  =  pK (M). 

This  offers  further  insight  into  why  the  rank  one  problem  is  easy.  For  the  general  problem  we  do 
not  have  any  such  guarantees  about  local  maxima,  and  in  fact  one  can  easily  construct  problems 
with  local  maxima  that  are  not  global. 

In  fact  we  can  characterize  the  solution  to  the  rank  one  //  problem  in  terms  of  this  alignment 
condition. 

Theorem  5  Suppose  we  have  M  =  uv*,  with  u,v  £  C"  satisfying  the  non-degeneracy  assumptions, 
and  a  compatible  block  structure  K .  Further  suppose  we  have  Q  £  Q/c  with  q\  -f  0  for  i  =  1 , . . . ,  mr 
such  that  Pr{QM)  =  0  >  0.  Then  we  have  0  —  p/c{M)  iff  there  exists  D  £  V/c  with  6 /  =  for 
\ql\  <  1  and  if  £  (— f  f)  such  that 

v  =  DQu  (68) 

Proof:  (— >)  Note  that  as  in  the  proof  of  theorem  2  we  can  show  that  Qu  and  v  are  the  right  and  left 
eigenvectors  of  QM  corresponding  to  the  distinct  eigenvalue  0,  with  v*Qu  —  0.  Since  0  =  p/c{M) 
we  are  at  a  local  maximum  by  assumption  and  the  result  follows  from  theorem  2  of  [15]. 

(<— )  Again  we  have  that  Qu  and  v  are  the  right  and  left  eigenvectors  of  QM  corresponding  to  the 
distinct  eigenvalue  0,  with  v*Qu  =  0.  But  now  for  each  i  <  mT  we  have 

<09'  =  cos  (Of)  +  j  sin(0t) 

so  that  defining  Dr  £  V/c,  Dr  >  0  and  Dj  £  Q/c  by 


Dr(  =  cos (OQDi  and  Dit  =  sin (OQDi 

for 

*=  1,-. 

. . ,  mr 

=  Dmr+i 

for 

i  =  l,.. 

,.,mc 

dRmr+mc+i  —  dmr+mc+i 

for 

■ ,  mc 
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we  have  v  =  eJ^’( Dr  +  j Dj)Qu  as  in  theorem  3.  From  this  alignment  we  can  construct  (a  sequence 
of)  D  £  T>£,G  £  Gk,  showing  that  pjc(M)  <  f3  (as  in  theorem  2)  and  hence  (3  =  pc(M).  □ 

Roughly  speaking  this  theorem  says  that  Q  achieves  p  iff  it  aligns  v  and  Qu  (as  above).  This  is 
employed  in  [13]  to  develop  a  lower  bound  power  iteration  to  compute  a  lower  bound  for  p*;(M)  for 
general  mixed  p  problems.  Note  that  for  this  theorem  we  added  some  technical  assumptions.  These 
can  once  again  be  dealt  with  via  the  machinery  of  lemma  2,  but  since  the  main  use  of  this  theorem 
is  for  the  lower  bound  power  iteration  in  [13]  we  do  not  bother  with  this  added  complication. 

Recall  that  we  remarked  earlier  that  we  can  solve  for  the  D,  G  scaling  matrices  in  the  upper 
bound  without  resorting  to  actually  solving  the  associated  convex  optimization  problem.  In  order  to 
do  this  however  we  need  to  know  the  value  of  pjc(M),  and  the  associated  destabilizing  perturbation 
Q  £  Qtc  (if  there  is  one).  In  the  following  section  we  will  see  that  in  fact  we  can  obtain  both  of 
these  quantities  in  closed  form,  so  that  we  have  a  complete  solution  to  the  rank  one  p  problem  in 
closed  form.  Thus  we  can  easily  compute  all  the  relevant  quantities  without  ever  having  to  resort 
to  numerical  solution  of  an  optimization  problem. 

6  A  Graphical  Interpretation 

It  is  interesting  to  consider  a  graphical  interpretation  of  the  rank  one  mixed  p  problem,  in  the 
complex  plane.  Suppose  that  we  have  M  =  uv* ,  with  u,  v  £  C",  and  some  compatible  block 
structure  K.  Then  note  that  for  any  A  £  BXk;  and  0  <  a  £  R  we  have  that 

&et{aln  —  AM)  =  0 

v*A  u  =  a  (69) 

mr  mc  me 

J2S*Vr,Ur,  +J26iV*iUCi  +  £  VcA?  ‘ UCi  = 

2=1  4  =  1  4  =  1 

This  equation  forms  the  basis  of  our  graphical  interpretation.  If  we  think  of  the  components, 
SJv*.uri,  6fv*.uCi,  Vc  Afud,  as  vectors  in  the  complex  plane,  then  the  rank  one  mixed  p  problem 
simply  amounts  to  choosing  S£,  Sf,  Af  so  that  these  vectors  add  up  to  a  positive  real  number, 
which  is  as  large  as  possible.  Note  that  it  is  now  evident  that  we  do  not  affect  the  problem  if  we 
throw  away  the  degenerate  blocks  (with  \v*.uri  \  =  0,  or  \v*.uCi  \  =  0,  or  \vct  ||uc',  I  =  0). 

First  of  all  it  is  clear  that,  since  the  complex  perturbations  can  have  arbitrary  phase,  we  will 
always  wish  to  choose  |<$f |  =  1  and  <r(Af )  =  1  so  that  these  vectors  have  their  maximum  length, 
namely 

Siv*^ci  =  ^ci\v*.uCi\ 

vc\  Af  uCl  =  ^Ci\vCi\\uCi\  (70) 

Furthermore  it  is  also  clear  from  the  geometry  of  the  problem  that  in  fact  we  will  wish  to  align  all 
these  vectors,  so  as  to  make  one  vector  of  maximal  length  whose  phase  we  are  free  to  choose.  Thus 
we  may  take  i\)Ci  =  and  ipd  =  V’  (with  appropriate  ranges  for  i ),  and  so  we  have 

mc  mc  ( mc  mc  \ 

+  E ”cA? «c.  =  <?*  E K“«l  +  E l«Cill«o.-l  (n) 

4  =  1  4=1  \4=1  4  =  1  / 
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where  we  have  one  free  parameter,  ip,  left  to  choose  for  the  complex  blocks.  Note  that  it  is  easy  to 
choose  and  Af  so  that  we  satisfy  (71).  With  this  observation  denote  0  <  Lc  G  R  as 

mc  me 

Lc  =  ^2\v*tuc,  |  +  £|»Cil|ttCil  (72) 

i= 1  2=1 

and  the  rank  one  mixed  /i  problem  reduces  to  choosing  real  numbers  6'f  G  [-1  1]  and  ip  G  [-7T  vr] 
so  as  to  maximize  0  <  a  G  R  where 


mr 

Y/6y;iuri  +  ^Lc  =  a  (73) 

2=1 

The  solution  to  this  problem  can  be  obtained  geometrically,  by  thinking  of  (73)  as  a  vector  sum  in 
the  complex  plane.  See  figure  1  for  an  example  illustration  with  three  real  blocks  (and  any  number 
of  complex  blocks).  Note  that  having  chosen  the  values  of  S[  then  it  is  easy  to  choose  ip  so  as 


Im 


Figure  1:  Graphical  solution  of  the  rank  one  mixed  fi  problem 


to  maximize  a,  or  alternately  to  verify  that  no  ip  exists  to  make  the  summation  in  (73)  add  up  to 
a  real  number.  The  real  question  then  is  how  to  choose  the  values  of  S[.  Consider  the  following 
algorithm  (some  of  the  key  ideas  for  this  approach  are  from  Newlin  [8]). 
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Algorithm  1  (Rank  One  p  Solution) 

1  Choose  starting  values  for  the  real  perturbations  as  S'-  —  Sgn(Re(t>*  ur% )).  Then  for  all  Sf  we 
have  Re(S;V*.uri)  >  0.  Now  compute 


'  mT 


S  =  Sgn  Imag  ]T  8iv*, 


Ki=l 


2  Rank  all  the  components  S( v*.  uri  by  argument,  so  that  the  highest  rank  is  assigned  to  the 
greatest  value  of  krg(S S'- v*.uTi). 

3  Consider  the  highest  rank  component  which  has  not  yet  been  looked  at.  Compute  Sopt,  which 
is  the  optimal  value  of  this  S\,  for  Sopt  £  R  unrestricted  in  sign  or  magnitude,  and  all  the 
other  real  perturbations  fixed. 

4  If  Sgn(Sopt)  =  — Sgn(7>[)  and  \Sopt\  >  1  and  not  all  the  components  have  been  looked  at,  then 
set  F  =  1,  else  set  F  =  0.  Now  reassign  S f  with  max[— 1  min[l  Sopt\\. 

5  If  F  =  1  then  go  to  step  3. 

6  For  these  values  of  S'-  compute  the  optimal  value  of  ip,  and  hence  p/cfM)  and  the  destabilizing 
perturbation  Q  £  <2/c  (if  there  is  one). 

This  algorithm  guarantees  to  compute  p  exactly  for  a  rank  one  mixed  p  problem,  together  with 
an  optimal  destabilizing  perturbation  Q  £  Q&  (if  there  is  one).  The  reasoning  behind  this  algorithm 
is  simple  if  one  thinks  of  the  problem  geometrically.  The  objective  is  to  make  the  summation 

mr 

Y28<V*r,Ur,+^Lc 

i= 1 

add  up  to  a  real  number  which  is  as  large  as  possible.  First  of  all  one  chooses  the  perturbations,  S(, 
so  that  the  real  parts  of  each  component,  Slv*.uri,  are  nonnegative.  Then  consider  the  summation 


mr 

Y,S»r, 


2=1 


(74) 


to  which  we  must  add  the  complex  component,  ei^Lc,  so  as  to  make  it  add  up  to  a  real  number. 
Suppose  the  imaginary  part  of  the  summation  in  (74)  is  nonnegative  (symmetric  arguments  apply 
for  the  other  case).  Then  we  consider  the  component  <S[u*.wr.  with  the  largest  argument.  This 
component  is  ranked  the  “worst”  component  in  the  summation,  in  the  sense  that  it  contributes  the 
most  positive  imaginary  part  (which  we  have  too  much  of)  for  a  given  positive  real  part  (which 
we  want).  Then  we  compute  the  optimal  value,  Sopt,  for  this  parameter,  SJ  (with  all  the  other  real 
perturbations  fixed),  and  reassign  S\  with  the  value  of  Sopt,  clipped  to  the  interval  [—1  1],  If  Sopt 
is  not  both  of  opposite  sign  to  the  original  value  of  SJ,  and  greater  than  one,  then  it  says  that 
you  could  not  improve  the  summation  by  further  reducing  the  imaginary  contribution  from  this 
component,  Slv*.uri.  But  this  component  had  the  “worst”  ratio  of  imaginary  to  real  contribution, 
so  you  could  not  improve  by  changing  any  other  component,  and  hence  you  are  done.  If  the  above 
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condition  is  not  met  then  you  could  get  further  improvement  with  this  component  so  you  check  the 
next  rank  component,  until  you  meet  the  condition  or  you  have  checked  them  all.  In  this  way  you 
proceed  with  at  most  a  linear  search  over  the  real  parameters  to  obtain  the  optimal  values  for  all 
the  real  perturbations,  S'- .  Given  these,  then  it  is  easy  to  compute  the  remainder  of  the  solution. 

It  is  easy  to  verify  that  the  computation  of  8opt  in  step  3  and  ip  in  step  6  boil  down  to  just 
simple  trigonometry.  It  can  also  be  verified  that  we  always  have  ip  G  (— f  f )  and  denoting  by  0,t 
the  angle  between  e^Lc  and  8^v*.uri,  then  E  [—  f  |].  Furthermore  if  \8\\  <  1  then  0t  =  ±f. 
But  now  if  we  examine  the  vector  summation  in  figure  1  then  we  see  that  this  solution  is  exactly 
the  alignment  condition  derived  in  theorem  3.  From  the  geometric  viewpoint  it  is  now  clear  why 
this  alignment  must  hold  at  the  maximum  of  pr(QM). 

Note  that  algorithm  1  requires  at  most  a  search  over  the  real  parameters,  which  grows  linearly 
with  mr.  All  the  computations  required  can  be  performed  via  simple  trigonometry,  so  that  this 
algorithm  is  really  a  shorthand  notation  for  the  closed  form  solution  to  the  rank  one  mixed  p 
problem  (which  would  otherwise  be  cumbersome  to  write,  involving  a  “max”  over  mr  possibilities). 
Thus  we  have  a  closed  form  solution,  with  trivial  computational  requirements,  for  both  p/c(M)  and 
the  associated  Q  G  Qk- 

The  fact  that  the  graphical  solution  to  the  rank  one  p  problem  is  so  simple,  really  lets  us  see 
what  is  going  on  with  the  problem.  As  an  illustration  of  this  we  immediately  obtain  the  following 
theorem. 

Theorem  6  Suppose  we  have  M  =  uv* ,  with  u,v  G  C”,  and  a  compatible  block  structure  1C.  Then 
in  computing  p/c(M)  it  suffices  to  consider  perturbations  A  G  Xtc  with  at  most  one  of  the  real 
variables,  S'-  having  \8r{  \  <  1. 

Proof:  Note  that  algorithm  1  starts  out  by  assigning  all  the  real  variables  at  extremal  values.  It 
can  be  seen  from  steps  4  and  5  that  the  algorithm  quits  if  ever  any  variable  is  reassigned  internally. 
Since  algorithm  1  guarantees  to  find  p>c(M )  we  find  that  an  optimal  destabilizing  perturbation  can 
be  found  with  at  most  one  real  variable  internal.  □ 


This  is  the  mixed  p  counterpart  of  the  well  known  “edge  theorem”  [1]  for  the  “affine  parameter 
variation”  case  for  a  polynomial  with  perturbed  coefficients  (see  section  3).  Note  that  the  result 
holds  for  pure  real  or  mixed  p  problems.  Once  again  the  reason  for  this  result  is  clear  when  we 
look  at  the  problem  geometrically:  if  we  have  more  than  one  variable  internal  then  we  can  always 
increase  one  of  them  (in  magnitude),  and  compensate  with  the  other  so  that  the  summation  (73) 
stays  real  and  does  not  decrease.  We  simply  do  this  until  all  but  one  (or  none)  of  the  real  variables 
is  at  its  extremal  value. 


In  fact  if  one  considers  the  geometry  of  the  problem,  then  it  is  possible  to  state  a  slightly 
stronger  version  of  this  “edge  result”:  aside  from  cases  where  we  have  real  degenerate  blocks  (with 
| v*.uri\  =  0),  or  real  blocks  with  the  same  phase  modulo  ±  (i.  e. ,  krg(v*turi)  —  Arg(u*  urj)  or 
kxg(vp.urj)  +  7r  for  i  /  j),  then  the  only  optimal  destabilizing  perturbations  are  on  the  edges. 

Note  that  an  exact  expression  for  pjc(M)  with  M  rank  one  was  also  obtained  in  [4]).  The 
authors  were  then  able  to  take  this  result  and  solve  several  problems  from  the  literature,  noting 
that  these  problems  can  be  treated  as  special  cases  of  rank  one  p  problems. 

Thinking  about  the  rank  one  p  problem  graphically  makes  it  easy  to  construct  examples  with  a 
particular  value  for  p/c(M),  and  particular  properties  for  the  alignment  condition  at  the  maximum 
of  Pr(QM).  To  conclude  we  present  a  series  of  such  examples  which  illustrate  certain  facts  about 
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p  and  rank  one  problems.  These  facts  may  not  be  obviously  true  (or  false)  from  the  definition  of 
p,  but  are  immediately  clear  when  one  considers  the  graphical  interpretation  for  the  corresponding 
rank  one  example. 

Fact  1:  For  a  rank  one  mixed  p  problem,  it  is  not  generic  that  the  worst  case  perturbation  is 
on  a  vertex.  Consider  the  following  example: 

M=(j)(ll)  A  =  diag(<T,«n 

with  6r  €  R  and  Sc  €  C.  Then  it  is  easy  to  see  that  the  worst  case  perturbation  is  Sr  =  0,8C  —  1. 
This  has  Sr  internal,  and  this  property  holds  for  small  perturbations  to  the  problem. 

Fact  2:  For  a  rank  one  mixed  p  problem,  it  is  not  generic  that  the  worst  case  perturbation  is 
not  on  a  vertex.  Consider  the  following  example: 

Af=(})(ll)  A  =  diag(*r,tfc) 

with  (5r  6  R  and  6C  €  C.  Then  it  is  easy  to  see  that  the  worst  case  perturbation  is  Sr  —  1,6C  =  1. 
This  has  8r  at  a  vertex,  and  this  property  holds  for  small  perturbations  to  the  problem. 

Fact  3:  For  a  rank  one  pure  real  p  problem,  it  is  generic  that  the  worst  case  perturbation  is 
not  on  a  vertex.  This  follows  by  noting  that  given  any  summation  as  in  (74),  then  we  can  perturb 
the  components  by  an  arbitrarily  small  amount  so  that  the  summation  cannot  be  made  purely  real 
with  every  <5[  =  ±1. 

Fact  4:  For  a  rank  one  pure  real  or  mixed  p  problem,  with  at  least  two  uncertainty  blocks,  it 
is  generic  that  prc(M )  >  0.  This  follows  by  noting  that  for  any  problem  with  at  least  two  blocks 
we  can  perturb  the  components  by  an  arbitrarily  small  amount  so  that  the  summation  (73)  can  be 
made  real  and  positive. 

Fact  5:  One  can  have  p  problems,  where  the  worst  case  perturbation  has  all  the  real  variables 
internal,  or  even  zero.  This  follows  from  the  example  given  in  fact  1. 

7  Conclusion 

It  has  been  shown  that  the  “affine  parameter  variation”  problem  for  a  polynomial  with  perturbed 
coefficients  can  be  recast  as  a  rank  one  mixed  p  problem.  This  setting  forms  the  basis  for  a  number 
of  “Kharitonov-type”  exact  robust  stability  tests  with  respect  to  real  parametric  uncertainty.  This 
rank  one  mixed  p  problem  has  been  shown  to  be  equivalent  to  its  upper  bound,  which  is  a  convex 
problem.  This  enables  exact  computation  in  the  p  framework  as  well,  and  in  fact  a  closed  form 
solution  to  the  rank  one  p  problem  was  obtained  with  trivial  computational  requirements. 
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